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Abstract 

This study applies a sliding-mode-based neural network to 
control inverted pendulum systems. Neural network 
weights are updated using a cost function which denotes 
distance from the sliding manifold. Thus, minimizing the 
cost function equals reaching the sliding surface. Sliding 
mode based neural network also makes the system robust to 
uncertainties in parameters and dynamical uncertainties. 
Chattering effect is solved by modifying the cost function. 
Simulations are fulfilled for a SISO and a MIMO model of an 
inverted pendulum. The results of simulations reveal the 
effectiveness of proposed method. 
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Introduction 

Due to hardship in obtaining control efforts in some 
applications, adaptive controllers like neural networks 
have been widely used [Bose (2007)]. In a neural 
controller, weights which construct the control effort 
are updated in order to minimize a cost function. One 
way to combine good features of classical control 
theory with neural networks is deriving a cost 
function which yields classical control goals. Cost 
function can be depicted as distance from a sliding 
manifold. This way, minimizing cost function is equal 
to approaching desired state values. 

One drawback of sliding mode control is the 
chattering effect. The chattering is generally 
undesirable because it involves extremely high control 
activities and may excite high-frequency dynamics 
neglected in modelling [Slotine & Li (1991)]. 

In [Yildiz et al. (2007)], the signal control obtained 
from AD ALINE neural network is updated using a 
cost function which denotes distance from the sliding 
manifold. Thus, merge good features of sliding mode 
and neural network. The method is applied to physical 


model of the linear servo drive. 

In [Wang et al. (2002)], a supervisory controller is 
appended into the FNN controller to force the states to 
be within the constraint set. Therefore, if the FNN 
controller cannot maintain the stability, the supervisory 
controller starts working to guarantee stability. On the 
other hand, if the FNN controller works well, the 
supervisory controller will be deactivated. The method 
is applied to an inverted pendulum system. We use 
this system for our simulations. 

Recently [Kayacan et al. (2013)], the control of a 
spherical rolling robot by using an adaptive neuro- 
fuzzy controller in combination with a sliding-mode 
control (SMC)-theory-based learning algorithm has 
been presented. The proposed control structure 
consists of a neuro-fuzzy network and a conventional 
controller which is used to guarantee the asymptotic 
stability of the system in a compact space. 

In this study the sliding-mode-based neural controller 
is applied to SISO and MIMO model of an inverted 
pendulum. The cost function is derived from 
Lyapunov stability criteria. As the cost function 
becomes smaller, outputs tend to the desired values 
and weights updating decreases. Simulations are 
brought afterwards. 

This paper is organized as follows: Problem 

statements given in Section 2. Section 3 presents the 
structure of neural network. Simulation examples to 
demonstrate the performance of the proposed method 
is provided in Section 4. Section 5 gives the 
conclusions of the advocated design methodology. 

Problem Statement 

Model of the System 

In this paper dynamics of a system including m 
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subsystems is considered. Dynamics are described by 
y” 1 1 = hi + bjUi + gi, wherey.* 1 is the Zth derivative of y t 
considering x = [y^yi, ....y"' -1 , ...,y” i_1 ] 7 ’ as 

state vector. The subsystems can be described as 
follows: 

x = f{x ) + Bix ) + d ( 1 ) 

In which x T E $R n is the state vector, n = Yih^i , 
u E is the control vector and fix ) E 91 n is an 
unknown, bounded and continuous function. Bix ) E 
foe input matrix, with continuous and 
bounded elements and rank{Bix))\y x = m . d E 91 n 
describes output disturbance, assumed bounded. All 
elements of fix ) E $R n and d E $R n are bounded. Fully- 
actuated mechanical systems can be described in the 
form of equation (1). 

Control Design 

Control law is derived from SMC structure. First, an 
appropriate sliding mode is selected to ensure 
dynamics' convergence to desired values. Control 
signal should be derived such that Lyapunov 
conditions are satisfied. Selecting the Lyapunov 
function using sliding mode is a natural and 
reasonable approach to get to the desired control goals 
that is tracking desired trajectory. 

Sliding mode 

For system described in equation (1), one choice for 
the sliding mode is 

o = Ge t = 0 ( 2 ) 

The tracking error vector is defined as e t = 

[e 1( ... , e{ ni-1) , ... , e m , , e^ 1 m_1) ] E 5R n , in which e* = 
y d . - y t , u = [o lf ... , o m ] T E and G E $l mxn . Matrix 
Dhas to be Hurwitz, to damp tracking error and its 
derivatives. Thus each elements of the vector u(e)is a 
function of output error, u. = a ij ^ with a t j > 
0, a tl = 1 describes function of tracking error and its 
derivatives, which has some roots in left half of s plane. 

Deriving Control Signal 

One Lyapunov function candidate is: 

V = Va (3) 

where V E 91. We can also assume V = (1/2) || cr||| in 
which || . || 2 reveals Euclidean norm with initial 
condition^ (0) = 0. Time derivative of the Lyapunov 
function has to be negative definite to ensure stability. 
We can equate V to a negative definite function, as 
following: 


T 0 T G 

V = -o t Dg-ii-—- (4) 

II ^ II 2 

Dis a symmetric positive definite matrix and {i > 0. 
Replacing (3) in (4), we have 

at ( &+Da+ti ¥f) = 0 (5) 

For o A 0, the control law is derived from: 

( d+D(T+ ^) = 0 (6) 

Thus sliding mode conditions are satisfied. 

Discontinuous term should be small enough in order 
to prevent the chattering effect in SMC. Because 
simulations are performed in discrete form, we can 
neglect the discontinuous term. Thus control signal is 
selected such that (d + Do) = 0. For further analysis, 
(Du) can be replaced by (Du + iig/o t o). 

For a system in the form of (1) and a sliding surface as 
(2), a control signal satisfying (u + Du) = Ois: 

u = — (GR) -1 (G(/ + d - x d .) - Du) = u eq + iGB)~ 1 Da (7) 
Where x d = [y dl , ...,y^ 1_1) , ...,y dm , ...,y^” m-1) ] , u eq is 
called equivalent control and is derived fromu = 0. 

In most of the works in literature which are combined 
Neural Networks with SMC, u eq is derived from a 
neural network [Kaynak et al. (2001), Morioka et al. 
(1995), Jezernik et al. (1997)]. Though, here we use a 
one-layer MLP network to get the whole control signal. 

Neural Network Structure 

One Layer MLP 

The structure used for the neural network in this study 
is shown in fig. 1. 



FIG. 1 NEURAL NETWORK STRUCTURE 

e t . is the zth row of e t vector, w t j shows weight between 
the zth and the ;th nodes and w i0 denotes bias term. 
Control inputs are defined as: 
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n 

Ui = ^ e t .w tj + 1 w l0 , i = 1, ... , m. (8) 

;'=i 

In fig. 1 activation functions are linear and the neural 
network is static. Equation (8) denotes a PD controller 
for a second-order system. For higher order systems, it 
denotes a state feedback controller. From (8), when 
tracking errors are zero, control signal equals the bias 
term, which rejects disturbance effect. 

In this paper, weights update is selected such that(d + 
Da) = 0. Thus Lyapunov conditions are satisfied. The 
cost function used is as follows: 

1 

E = - (a + Da) T (a + Da) (9) 

As E -> 0 with weights update, (a + Da) = 0 is 
satisfied, the states move on the sliding surface and 
converge to the desired values. 

Control law is derived from SMC structure. First, an 
appropriate sliding mode is selected to ensure 
dynamics' convergence to desired values. Control 
signal should be derived such that Lyapunov 
conditions are satisfied. Selecting the Lyapunov 
function using sliding mode is a natural and 
reasonable approach to get to the desired control goals 
which is tracking desired trajectory. 


Updating Weights 

Weights are updated as following: 

dE 

(10) 

r] > 0 is the learning coefficient. Using Chain rule, we 
have 


Wij = -rj 


dE du t 
du t dwij 


da 


% 


Wij = -T](a + Da) T — e tj 

r . , n d(Gx d - Gx ) 
w„ = -,(<, + BaV — ^ — 

If we rewrite equation (1) as 

x = /(*) + [B^x) : ••• ! £ m O)] i +d 

Mm 

Replacing (14) in (13) concludes 

= —r] (<j + Da) T GB i (x)e t . 

For updating bias weights, w i0/ we have 


( 11 ) 

( 12 ) 

(13) 

(14) 

(15) 


w i0 =r](a + Da) T GBi(x) (16) 

If we select nonlinear activation functions instead of 
linear ones, updating weights is similar to the above 
procedure, though updating terms are multiplied by 


the derivative of the activation function, that is 
w l} = 77 ( 0 - + Day c/iGBiWet.. 

In the above approach when the cost function equals 
zero, that is (a + Da) = 0, updating stops and the 
states reach the desired values. In [Yildiz et al. (2007)], 
It is shown that the minimum of cost function is global, 
because the second derivative of cost function always 
stays positive. 


Stability 


The Lyapunov candidate is selected as 
1 

V = -(d + Dg) t (a + Da) (17) 

It can be easily shown thatk > 0, while (a + Da) A 0. 
When (a + Da) = 0, we havek = 0. Differentiate the 
above equation, we have 


*-11 

7 = 1 /=n 


dV dwij 
dw t j dt 


+5(7)7 


(18) 


g(y ) is the derivative of V on other parameters. 
Replacing (10) in (24), we have 


m n 2 

’ 7 = "ZZ(a4) +aiy) t (19) 

i-1 j = 0 v J/ 

To ensure stability, learning rate should be chosen 
large enough. Thus, derivative of Lyapunov function 
will be negative definite, V < 0. 


Simulations 


SISO Case 


Here we will apply sliding-mode-based neural 
controller for two cases. First we consider SISO case 
with an inverted pendulum system that is shown in fig. 
2 [Wang et al. (2002)]. 



If we choose*! = 6 to be the angle of the pendulum 
with respect to the vertical line, the dynamic equations 
of the inverted pendulum system are 


*i 

A 


f° 

lo 


°]0 


( 20 ) 
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Where 


/ = 


g v sin x 1 — {mlx\ cos x ± sin x t ) 


^ /4 mcos 2 x 1 \ 
\3 m c +m J 


cosx 1 

m c +m 


^ /4 mcos 2 x 1 \ 
\3 m c +m J 


> 0 


(21) 


g v = 9.8 meter /(sec 2 ) is the acceleration due to 
gravity, m c is the mass of the cart,J is the half-length of 
the pole, m is the mass of the pole and u is the control 
input. Here, we assume m c = 1kg , m = 0.1kg and 
l = 0.5 meter. For implementing sliding-mode-based 
neural controller on this continuous system, we should 
discretize it with a proper sampling time. Thus, 
updating states would be as following: 


X(k + 1) = X(k) + Ts x dX(k + 1) (22) 

Structure of system with controller is shown in fig. 3. 
Desired trajectory for angle of pendulum is a sinusoid. 
We assume that the neural network is a one-layer 
linear network. To control the angle of pendulum, the 
sliding manifold is chosen as a = e + Ce , where 
e = 6 r — 6 refers to the error in angle of pendulum. We 
select controller parameters asC = 2, D = 2 andry = 0.1, 
and the sampling time T s = 0.001s. 
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FIG. 3 STRUCTURE OF SYSTEM WITH CONTROLLER 



FIG 4 TRACKING ANGLE OF PENDULUM 



FIG 5 TRACKING ANGULAR VELOCITY OF PENDULUM 


e 

UJ 


Cost Function 






T T 


T T 

T 






0 0.5 1 1.5 2 


FIG 6 TRACKING ERROR OF PENDULUM 



FIG 7 CONTROL INPUT APPLIED TO CART 


Figs. 4-7 show the response of the system to the 
sinusoid reference input for angle. Fig. 4 shows the 
output trajectory 6 and reference output^, where the 
reference trajectory is tracked perfectly and error is 
hardly noticeable. Fig. 5 shows the second state of the 
system, which is angular velocity of pendulum and 
tracked perfectly similar to angle of pendulum. Fig. 6 
shows the tracking error in logarithmic axis. It can be 
seen, error decreases fast during tracking and remains 
in a limit bound for steady-state. In fig. 7 the control 
signal is shown, that is seen to be smooth. 

The presented results show that sliding-mode neural 
controller works suitably, and the states converge to 
sliding surface properly. This convergence is achieved 
by a simple weight update algorithm and an uncertain 
system with limited knowledge on system parameters. 


MIMO Case 


In MIMO case, we consider two inverted pendulums 
connected by a moving spring mounted on two carts 
(fig. 8) [Yang et al. (2010)]. In this system position of 
the pivot in moving spring is a function of time, which 
can change along the full length l of the pendulums. 
The inputs of the system are torque u { applied at the 
pivot point of each pendulum. The motion of carts is 
assumed to be sinusoid trajectories. Each pendulum is 
assumed as a decoupled subsystem of the whole 
system. The objective is to control the angle of each 
pendulum with only its information so that each 
pendulum tracks its own desired reference trajectory 
while the connected spring and carts are moving. 

If we define*; = \0 it 6i[ = \x n ,x i2 ] T ,i = 1,2, as the 
systems states, the dynamical equations of the coupled 
pendulums can be described as following: 


*i = 



0 

r 


r o 

9 

ka(t)(a(t) - cl) 

0 

X'l + 

l 

-cl 

cml 2 


-cml 2 - 


u ± 


0 0 
ka(t)(a(t ) — cl) 


cml 2 
m 

M 

/c(a(t) - cl) 


sinCxu) xl 2 


cml 2 


Oi - Vz) 


(23) 
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*2 



0 

r 


r o 

9 

ka(t)(a(t) - cl) 

0 

x 2 + 

l 

-cl 

cml 2 


.cml 2 . 


U 2 


0 0 
ka(t)(a(t ) — cl) 


cml 2 

— sin(x 21 )x| 2 

k(a(t) - cl) 

+ cmP 0/2 “ yi) 



(24) 


FIG 8 THE COUPLED DOUBLE INVERTED PENDULUM 

Where C = M/(M + m),/c and $ are spring and gravity 
constants and u x and u 2 are pendulums control 
torques. We choose g = 9.8, 1 = 1, k = 1, M = m = 4 
for simulations. The motions of the carts are assumed 
to be sinusoids, that is y 1 = sinCoqt) and y 2 = L + 
sin(a) 2 t), where L is the natural length of the spring 
and (x> 1 A co 2 . Here, we select oq = 2 , L = 2 andca 2 = 3. 
Also, we choose a(t) = sin St . Considering X = 
[x 1 ,x 2 ] t = [0 1 ,6 1 ,6 2 ,6 2 ] T as the whole system states, 
we can reach the standard form of equation (1). 

Again we consider sine-wave trajectories as desired 
angle of pendulums. We also assume one-layer linear 
network for sliding-mode-based neural controller. The 
sliding manifold for the first subsystem is chosen as 
a i = £i + Ce lr where e 1 = Q lr — Q 1 refers to the error in 
angle of first pendulum. For the second subsystem the 
sliding manifold is chosen as o 2 — e 2 + C e 2 , where 
s 2 — 0 2r — 0 2 refers to the error in angle of the second 
pendulum. We select controller parameters as C = 2, 
[2 1] 

D = ll 2-1' P os iti ve definite,!? = 0.4, and the 

sampling time T s = O.OOls.Both references are applied 
at the same time. 

Figs. 9-14 show the response of the system to the sine- 
wave reference for each angle. As shown in Figs. 9-10, 
the output trajectory 6 1 tracks the reference output 6 ±r 
perfectly, while simultaneously output trajectory 0 2 
tracks the reference output# 2r . Figs. 11-12 show the 
second states of the subsystem, angular velocity of 
each pendulum, which are tracked perfectly similar to 


the angle of pendulums. Fig. 13 shows the tracking 
error in logarithmic space, which remains bounded in 
steady-state. As it is seen in fig. 14 the control signals 
are bounded and well-behaved. 


The presented results show that sliding-mode-based 
neural controller works suitably for MIMO systems, 
where decoupled and interaction terms are assumed 
disturbance. All the states converge to sliding surface 
properly, and also the system is capable of coping 
with harmonic changes in references. 



FIG 9 TRACKING ANGLE OF FIRST PENDULUM 



FIG 10 TRACKING ANGLE OF SECOND PENDULUM 



FIG 11 TRACKING ANGULAR VELOCITY OF FIRST PENDULUM 



FIG 12 TRACKING ANGULAR VELOCITY OF SECOND 
PENDULUM 



FIG 13 TRACKING ERRORM OF PENDULUMS 
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Control Inputs 



FIG 14 CONTROL INPUTS APPLIED TO EACH PENDULUM 


Conclusion 

In this paper a neural network based on sliding mode 
is proposed for an inverted pendulum. Weight 
adaptation in the neural network uses a cost function 
derived from Lyapunov stability criteria. The aim in 
this study is to develop a learning method for 
parameters of neural controller that not only can be 
applied without the need for calculating Jacobean of 
plant but also guarantees stability and robustness of 
the learning approach. According to the simulations 
for SIS O and MIMO case, good tracking characteristics 
in outputs are obtained. 
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